TSAB - Web Interface for Transcribed Speech Collections
نویسندگان
چکیده
This paper describes a new web interface for accessing large transcribed spoken data collections. The system uses automatic or manual time-aligned transcriptions with speaker and topic segmentation information to present structured speech data more efficiently and make accessing relevant speech data quicker. The system is independent of the underlying speech processing technology. The software is free and open-source.
منابع مشابه
Text-Style Conversion of Speech Transcript into Web Document for Lecture Archive
It is very significant to the knowledge society to accumulate spoken documents on the web. However, because of the high redundancy of spontaneous speech, the faithfully transcribed text is not readable on an Internet browser, and therefore not suitable as a web document. This paper proposes a technique for converting spoken documents into web documents for the purpose of building a speech archi...
متن کاملA Web Application for Automated Dialect Analysis
Sociolinguists are regularly faced with the task of measuring phonetic features from speech, which involves manually transcribing audio recordings – a major bottleneck to analyzing large collections of data. We harness automatic speech recognition to build an online end-to-end web application where users upload untranscribed speech collections and receive formant measurements of the vowels in t...
متن کاملChild phonology analyzer: Processing and analyzing transcribed speech
This paper describes two algorithms for analyzing transcribed speech corpora: (1) identification of phonological processes, and (2) phonological queries. The algorithms are implemented in Visual Basic for Applications for Microsoft Excel, thus exploiting Excel’s mass‐calculation capabilities to analyze large corpora quickly. The user interface features a set of editable tables that contain defi...
متن کاملQuerying Xml Document Collections
In this paper we describe a query interface towards XML document collections. External schema annotation in RDF contains information used to dynamically build the interface tailored to the user’s characteristics and to the document structure, as described by its XML Schema. The interface makes the user aware of structure semantics, so supporting her/him in formulating semantically correct queri...
متن کاملEvaluating Speech-Driven Web Retrieval in the Third NTCIR Workshop
Speech recognition has of late become a practical technology for real world applications. For the purpose of research and development in speech-driven retrieval, which facilitates retrieving information with spoken queries, we organized the speech-driven retrieval subtask in the NTCIR-3 Web retrieval task. Search topics for the Web retrieval main task were dictated by ten speakers and were reco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011